AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
GUI agent

# GUI agent

Internvl3 8B
Apache-2.0
InternVL3 - 8B is an advanced multimodal large - language model with excellent multimodal perception and reasoning capabilities, capable of processing multimodal data such as images and videos.
Multimodal Alignment Transformers
I
unsloth
224
1
Internvl3 1B GGUF
Apache-2.0
InternVL3 - 1B is an advanced multimodal large language model that excels in multimodal perception, reasoning, and other abilities. It also expands multimodal capabilities such as tool use and GUI agent.
Multimodal Fusion Transformers
I
unsloth
868
2
Internvl3 14B Hf
Other
InternVL3-14B is a powerful multimodal large language model that excels in multimodal perception and reasoning abilities and supports multiple inputs such as images, texts, and videos.
Image-to-Text Transformers Other
I
OpenGVLab
4,260
0
Internvl3 8B
Other
InternVL3-8B is an advanced multimodal large language model with excellent multimodal perception and reasoning capabilities, and performs well in multiple fields such as tool use, GUI agents, and industrial image analysis.
Multimodal Fusion Transformers Other
I
FriendliAI
167
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase